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The Schrodinger-HJW Theorem 

K. A. Kirkpatrick 

JVew Mexico Highlands University, Las Vegas, New Mexico 87701* 

A concise presentation of Schrodinger's ancilla theorem (1936 Proc. Camb. Phil. Soc. 32, 446) 
and its several recent rediscoveries. 

1. Introduction 

We re-present a theorem which Schrodinger proved in 1936. He commented that this 
theorem was one "for which I claim no priority but the permission of deducing it in the 
s ! ■ following section, for it is certainly not well known." His comment was amusingly prescient: 

J> ' The theorem was rediscovered by Jaynes in 1957 (whose work was extended by Hadjisavvas 

(1981)), rediscovered by Hughston, Jozsa, and Wootters (HJW) in 1993 (this last an expan- 
sion of a 1989 partial rediscovery by Gisin); in 1999, Mermin simplified a portion of HJW's 
proof — and it would appear none of these were aware of Schrodinger's work. Furthering 
the irony, Mermin commented that this is "a pertinent theorem which deserves to be more 
widely known." 

^C) ' But not only more widely known; this theorem deserves treatment in terms of physically 

^ | relevant ancillae (following Mermin) rather than formal transformations by orthonormal- 

column matrices, and it deserves a complete statement and a concise proof in one place. 
Here is our attempt at such a presentation. 

O ■ 

in : 

£T) '. 2. Preliminaries 

o : 

, Throughout, Tt s and Tt M are Hilbert spaces with dimensions n$ and hjk, respectively, 

CL,; Tr M { • } is the trace over H M of an operator on H s ® H M , | W SM ) is a vector in H s ® H M , 

and p s = Tr M { | ^ SM )(^) SM | }. The dimension of the support of p s is n p , n p <n s . 

Lemma. | \) and | <f>) are vectors in TL S ®Ti. M . If Tim {\x)(x\} = T^M { I 4')( < f > \}> then 
^ ' there exists a unitary transformation U on Ti M such that \x) — (l ® U)| <fi) . 

Proof: The operator X = Tr^vi { | X ) \ X I } i s positive and Hcrmitian, so its eigenvalues are 
non- negative and its eigenkets { \ pj) }™f 1 are an orthonormal basis of 7i s ; thus 

'x : 

5-h ' Wj > 0, 1 < j < n < n$, 

X = > w s \p s )(p s I, with " (1) 

~[ Wj = 0, J > n. 

For { | N ) } any basis of H M , \ X > = E^i ££i At \P.Ht)\ setting \r, s ) = E^i At | Mt ) , 
we have | X ) = X^=i I P^Vs ) ■ Then 

ns n 

X = \Ps)(Ps< |(v I r? a ) = J! Ws \Pb)(Ps \, (2) 
s s— 1 

whence (r) s i \ r) s ) — w s 5 ss i. Hence the { | bj) = | r)j )/y/ w j}"_ 1 are orthonormal and we may 

write |x) = Y^=i \f u ^s\Psb s ) ■ A similar argument leads to \<p) = Ej=i \f w H\Pi c j)i 
the { | Cj ) } orthonormal. Extend the sets { | bj ) } and { | Cj ) } to orthonormal bases of 
Tt M and set U = \°s)(c s |; clearly, U is unitary and 1 ® U performs the desired 

transformation. □ 
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Definition (p s -cnscmble) . Given a positive, Hermitian, unit trace operator p s , a p s - 
ensemble of order n (n > n p ) is a set { (| <frj ) G H s , itfj > 0) } with X^"=i Ws = 1 an< ^ 
the { | 0j : ) }™ =1 noncollinear, such that = ^™ =1 w s | </>s )( <t>s I- We will call a p 5 -ensemble 
linearly independent if the { | ^ ) }™ =1 are linearly independent. 

Definition (Ancilla). A set { | bj } G H M }™ =1 is an ancilla of the p 5 -ensemble 
{ (| <f>j ) G H s , Wj > 0) }™ =1 iff it is an orthonormal set and \q SM ) = £™ =1 S | S 6 S ), 
|^| 2 = Wj. 

Definition (U-map). The p 5 -ensemble { (\ipk), v k) } k - 1 is U-mapped to the p 5 -ensemble 
{ (| <pj ), Wj) }™ =1 iff there exists a set 

n 

{u jk eC\Y^ u tku* tk , = 5 kk , } |™ fe 7=(i4) ( 3 ) 



t=l 



such that 



jkj> = 5Z«j*V«t|^t>[ • ( 4 ) 



k t=i 

A unitary transformation U on 7i M generates this U-map iff { ujk = ( bj |U| b k ) Y(j'k)L(i i) > 
with { | 6 j ) G j.™^™' 1 ™) ancilla of the greater-order p 5 -ensemble. 



3. The Theorem 



Theorem (Schrodingcr-HJW). The state of S®M is \^ SM ); p s = Tr M { \^ SM ) (V s M I}- 
Then 

(a) . Every p s -ensemble of order n < um has a corresponding ancilla in Tt M . If the p s - 
ensemble is linearly independent, the ancilla is unique. 

(b) . Every orthonormal basis of H M contains exactly one ancilla corresponding to exactly 
one p s -ensemble. 

(c) . Given any two p s -ensembles, there exist unitary transformations on H M which generate 
a U-map from one to the other; if each p s -ensemble is linearly independent, the transforma- 
tion is unique. 

(d) . Every unitary transformation on H M generates a U-map of every p s -ensemble to 
another. 

(e) . Every vector in the support of p s appears as an element of at least one p s -ensemble. 
Proof: 

(a) . Given the p 5 -ensemble { (| 4>j )> Wj) }™_ l5 use an arbitrary orthonormal set 
{ | dj ) G Ti M to construct the vector | <£' ) = X)"=i <t>s \ 4>s d s )• By the Lemma, there 
exists a unitary transform U on H M such that 

n 

\* SM )= (l 5 ®U)|* / )=^0 s |</» s )®U|d s ); (5) 

8=1 

the ancilla is the orthonormal set { U | dj ) }™ =1 - Clearly, if the { | <f>j ) } are linearly inde- 
pendent, that ancilla is unique. 

(b) . { | b k ) } is an arbitrary orthonormal basis of Tt M ; expand | ^ SM ) in terms of it and 
any basis {\ Pj )} oiH s : \^> SM ) = £"=i Yl=i *jk\Pj b k ). Define <j> k \ fa ) = E^i a jk \ Pj ) 
and Wk = \<pk\ ; an d re-order the index k so Wk > 0, k < n < riMi w k — 0, k > n; then 

n 

|* 5 "> = 5>-| (6) 

3=1 
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Thus { | bj ) }™ =1 is an ancilla, and { (| cfij ), Wj) is the corresponding p^-ensemble. Sup- 
pose the set { | bj ) }™^ contained two ancillae, say { | bj ) | j <E .4 } and { | ) | fc e } , with 
A,Bd{l...n M }: 

\^ sM ) = j2^\^ s b s ) = j2^\^b t ). (?) 

se.4 teB 

The orthonormality of the { | b j ) } guarantees A = B, hence the uniqueness of the ancilla 
within { | bj ) }, and it guarantees \<j)j) — \ tpj ) and <fij — ipj for all j in A, hence the 
uniqueness of the p^-ensemble corresponding to that ancilla. 

(c). Given two /^-ensembles { (| <j)j ), Wj) and { (| ipk), v k ) }J? =1) (a) guarantees ancil- 
lae { | bj ) e H M }™ 1 and { | c fc ) e H M }™ =1 , respectively, with 

m n 

\^ sM ) = j2v^\4>sb s ) = j2v^\Ac t ) (8) 

s=l t=l 

(phases are absorbed into the ancillary kets). Extend { | b 3 ■) } and { | Ck ) } to orthonor- 
mal bases of H M ; the unitary operator U = Y^t=i \ c t)(bt | transforms | c k ) = U | bk ) = 
YT s =i u sk\ b s ), where u jk = ( bj | U | b k ) = (bj | c k ), so 



s=l ^ t=l ' 



(9) 



• t= 

Equating (8) and (9), we obtain the U-generated U-map 



±^u M = \f M ^{ 1 ;- m > (io) 

£rj 10 otherwise. 

If each p 5 -ensemble is linearly independent, each ancilla is unique, so the transformation 
between them is unique (modulo element-label permutation). 

(d). Given any /Aensemble, (a) guarantees an ancilla; this may be extended to an or- 
thonormal basis of TL M . Any unitary transformation maps this to another orthonormal 
basis; by (b), this second basis contains a single ancilla correlated to a single p 5 -ensemble. 
By Eq. (10), this unitary transformation provides the U-mapping between these two p s - 
ensembles. 

(c) . | £ ) is an arbitrary vector in the support of p s ; we will construct a p 5 -ensemblc with 
| £ ) as its first element. The Schmidt expression of |>I'' s - A/( ), Y^l=i V'sl Ps a s ), gives a basis 
for the expansion our arbitrary vector: |£) — Y^l=ils\Ps) > with Y^=i Its 1 2 — 1- F° r anv 
nj^ x riM unitary matrix Uj k we have 

n 

\^ SM ) = ^4>t\(l>tbt), whem 4>j\4>j) = Y J ^sU s j\Ps) sjid\b k ) = Y^ u *sk\o. s )- (11) 

t s—l s 

Let us find Uj k such that the first element of this p 5 -ensemble is our arbitrary vector, that 
is, \ = Then 

n \ 71 

I'M = ^ls\p s ) = —^2ip s u sl \p s ), (12) 

s=l <?>1 s=l 

whence u s i = </>i7 s /Vv Unitarity requires J2 S \ u si\ 2 = 1 = 4>i J2 S lls/i's] 2 - Thus we have 



i&i>=5>:ii°»>. with %1 -( 7 ,M)/^E:=ii7 S Mi 2 - as) 

s 

Arbitrarily complete the orthonormal set { | b k ) jfe^, which, by (b), must contain an ancilla 
of a p 5 -ensemble containing | £ ) . □ 
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4. Overview of the several treatments 

Schrodinger (1936) established our parts (a), (d), and (e). 

Jaynes (1957, p. 173) attempted to establish our (c) and (d). He required, for a particular 
matrix A and a transformation T, the equality AA^ = ATT^A^, which he incorrectly 
interpreted to require 1 the unitarity of T. Thus Jaynes failed to establish his (correct) 
claim of isomorphism between the group of unitary transformations and the group of U- 
mappings of p^-ensembles. Hadjisavvas (1981), extending Jaynes, elegantly obtained our 
(c), (d), and (e) for infinite dimensions. It appears that none of the later authors were 
aware of either Jaynes' or Hadjisavvas's results. 

Gisin (1989) established a weak (and unphysical — see below) version of our (a). 

HJW establish our (c) and (d) in their theorem's parts (b) and (a), respectively, and 
our (a) in the "application" in their Section 3.3. (The term "p 5 -ensemble" is due to HJW; 
Jaynes called it an "array".) 

The "GHJW Theorem" presented by Mermin (1999) is essentially HJW's Section 3.3 
(i.e., our (a)). Our proofs of the lemma and of part (a) of the theorem are quite similar 
to the development in the appendix of Mermin (1999). (In personal correspondence, David 
Mermin gives credit for what I've formalized as the Lemma to a conversation with Chris 
Fuchs.) 

Our (b) ("every complete variable of M. determines exactly one p 5 -ensemble" ) seems to 
have its first explicit statement here. 

We see that this theorem is primarily Schrodinger's — not only by priority, but by having 
established the major portion, parts (a), (d), and (e). Hadjisavvas and HJW added part (c). 
It is fairly well-known by the initials "HJW"; taking all this into account, and letting the 
"H" work a little harder, we arrive at "Schrodinger-HJW" as a priority-recognizing name. 



5. Comments and Discussion 

The U-map (orthogonal-columns matrix transformation) approach of Schrodinger and 
HJW, though it successfully generates all p -ensembles, has no obvious physical significance. 
Mermin, by treating the ancillary system M. as a real physical system with \fy SM ) as the 
actual quantum- mechanical state of S M (and utilizing Fuchs' "lemma"), was able to 
obtain an arbitrary /^-ensemble's ancilla (HJW's Section 3.3) without using the U-maps of 
HJW's theorem. This makes it possible to express the HJW theorem in terms of unitary 
transformations between observables in the ancillary system (our (c) and (d)). Instead of the 
mathematical formality of the U-maps, we have the physical significance of the correlation 
of each ancillary variable with a p^-ensemble. 

In Gisin's treatment, \^> SM ) is arbitrarily imposed: The state vector of the joint system 
is constructed in the proof (as was the | of our proof), not assumed given (as was the 
|,j)SA^ Q f our theorem). However, it is necessary that the two systems were prepared ab 
initio in a pure joint state (i.e., |\& ^)) which reduces to the specified mixture; Gisin's 
"steering" theorem, as presented, is not physical. Schrodinger and HJW make this clear; 
Mermin does also, in the body of his paper. However, in his appendix, Mermin states that, 
for any mixed state of Alice's system, it is always "possible to provide Bob with a system 
of his own for which the joint Alice-Bob system has a pure state" which reduces to Alice's 
mixture. As expressed, this statement may mislead: For example, if Alice's system is mixed 
because it is already entangled in a pure joint state with Carol's system, then no such system 
may be provided to Bob (except by stealing Carol's). That is, it is not possible to introduce 
the ancillary system post facto — it and the system must be initialized together. 

To Schrodinger, the central point was that u in general a sophisticated experimenter can, 
by a suitable device which does not involve measuring non-commuting variables, produce 
a non-vanishing probability of driving the system into any state he chooses" by means 



1 This equality is satisfied by TTt = 1 + X with AXAt = 0. 
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of measurement on the entangled ancilla. Unfortunately, Schrodinger's phrase "driving 
the system into any state he chooses" tends to overwhelm the modifier thereof, "a non- 
vanishing probability," so Schrodinger's point tends to be misstated; for example, Jammer 
(1974, p. 221) summarizes Schrodinger's statement "an experimenter can indeed steer a far- 
away system, without interfering with it at all, into any state out of an infinity of possible 
states. . . ," leaving an incorrect impression of deterministic control of the outcome. 

The Schrodinger-HJW Theorem gives a complete catalog of potential correlations between 
the p 5 -ensembles of S and the disjoint (orthogonal) sets of states of M, in the case that 
S © M. was prepared in a pure state. This catalog allows the design of an experiment 
involving a measurement on M. which makes exactly one p -ensemble "visible" — that is, 
allows every occurrence of S to be assigned a particular state of the p-ensemble without 
"disturbing" S. To be more precise, the measurement 2 of an ancilla (say, an observable 
B of M. with eigenkets { | bj ) }, the ancilla of the /Aensemble { (\(fij), Wj) }) changes the 
state of S 8 M from the pure state \^ SM ) = ^/wle 18 " \<j> a b a ) to the mixture p S(BM = 
^2 s w s \(j) s b s )(<f) s b s \. The correlation of this particular ancilla with this particular p s - 
ensemble is unique 3 (Kirkpatrick, 2002) — none of the other putative correlations in the 
Schrodinger-HJW theorem survive the measurement of the ancilla. 
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2 A "measurement" of B is an interaction of M with X (a system external to S © M.) such that | b 3 ; £ ) > 

| bj Xj }, the { | Xj ) € H x } orthonormal; the state of S ffi M changes from pure to mixed as a direct result 
of this unitary transformation on H s ®7~t M ® H x . So long as S © M remains isolated, not only do the 
ancilla and p-ensemble remain correlated, but also the { | b 3 • ) } maintain their correlation with disjoint 
elements of the exterior (initially the {\xj)}), regardless of further external interactions with X; von 
Neumann's infinite regression of measurements is irrelevant to our discussion. 

3 The correlation is unique even in the case of degenerate probabilities (e.g., EPR). 



